A Modified Character Segmentation Algorithm for Farsi Printed Text Using Upper Contour Labelling

Authors

  • E. Kabir and R. Azmi
  • H. Nezamabadi-Pour
Abstract:

In this paper, a modified segmentation algorithm for printed Farsi words is presented. This algorithm is based on a previous work by Azmi that uses the conditional labeling of the upper contour to find the segmentation points. The main objective is to improve the segmentation results for low quality prints. To achieve this, various modifications on local baseline detection, contour labeling and segmentation rules have been applied. In an experiment, the correct segmentation rate was 97%. Based on the results obtained, a detailed error analysis is presented which should be useful for furthur research on this topic.

Upgrade to premium to download articles

Sign up to access the full text

Already have an account?login

similar resources

Segmentation-free optical character recognition for printed Urdu text

This paper presents a segmentation-free optical character recognition system for printed Urdu Nastaliq font using ligatures as units of recognition. The proposed technique relies on statistical features and employs Hidden Markov Models for classification. A total of 1525 unique high-frequency Urdu ligatures from the standard Urdu Printed Text Images (UPTI) database are considered in our study. ...

full text

A Chinese Character Segmentation Algorithm for Complicated Printed Documents

The character segmentation technology for printed documents plays an important role in optical character recognition, ticket information identification, postal code identification, automatic license plate recognition and so on. In this paper, a Chinese characters segmentation algorithm for complicated printed documents is proposed for the application in paper watermarking system. In this applic...

full text

My Resources

Save resource for easier access later

Save to my library Already added to my library

{@ msg_add @}


Journal title

volume 23  issue 1

pages  33- 48

publication date 2004-07

By following a journal you will be notified via email when a new issue of this journal is published.

Keywords

Hosted on Doprax cloud platform doprax.com

copyright © 2015-2023